This is a reasoning reranking agent model built upon Qwen-2.5-7B for the paper REARANK: Reasoning Re-ranking Agent via Reinforcement Learning. The model is trained on reranking dataset built from only 179 queries using GRPO to perform reranking task, the codebase is at https://github.com/lezhang7/Rearank

image/png

image/png

Downloads last month
95
Safetensors
Model size
7.62B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for le723z/Rearank-7B

Base model

Qwen/Qwen2.5-7B
Finetuned
(2327)
this model
Quantizations
2 models